Multi-Reader Multi-Case Studies Using the Area under the Receiver Operator Characteristic Curve as a Measure of Diagnostic Accuracy: Systematic Review with a Focus on Quality of Data Reporting
نویسندگان
چکیده
INTRODUCTION We examined the design, analysis and reporting in multi-reader multi-case (MRMC) research studies using the area under the receiver-operating curve (ROC AUC) as a measure of diagnostic performance. METHODS We performed a systematic literature review from 2005 to 2013 inclusive to identify a minimum 50 studies. Articles of diagnostic test accuracy in humans were identified via their citation of key methodological articles dealing with MRMC ROC AUC. Two researchers in consensus then extracted information from primary articles relating to study characteristics and design, methods for reporting study outcomes, model fitting, model assumptions, presentation of results, and interpretation of findings. Results were summarized and presented with a descriptive analysis. RESULTS Sixty-four full papers were retrieved from 475 identified citations and ultimately 49 articles describing 51 studies were reviewed and extracted. Radiological imaging was the index test in all. Most studies focused on lesion detection vs. characterization and used less than 10 readers. Only 6 (12%) studies trained readers in advance to use the confidence scale used to build the ROC curve. Overall, description of confidence scores, the ROC curve and its analysis was often incomplete. For example, 21 (41%) studies presented no ROC curve and only 3 (6%) described the distribution of confidence scores. Of 30 studies presenting curves, only 4 (13%) presented the data points underlying the curve, thereby allowing assessment of extrapolation. The mean change in AUC was 0.05 (-0.05 to 0.28). Non-significant change in AUC was attributed to underpowering rather than the diagnostic test failing to improve diagnostic accuracy. CONCLUSIONS Data reporting in MRMC studies using ROC AUC as an outcome measure is frequently incomplete, hampering understanding of methods and the reliability of results and study conclusions. Authors using this analysis should be encouraged to provide a full description of their methods and results.
منابع مشابه
Receiver Operating Characteristic (ROC) Curve Analysis for Medical Diagnostic Test Evaluation
This review provides the basic principle and rational for ROC analysis of rating and continuous diagnostic test results versus a gold standard. Derived indexes of accuracy, in particular area under the curve (AUC) has a meaningful interpretation for disease classification from healthy subjects. The methods of estimate of AUC and its testing in single diagnostic test and also comparative studies...
متن کاملDiagnostic accuracy of fecal calprotectin in assessing the severity of inflammatory bowel disease: From laboratory to clinic
Background: Inflammatory bowel disease (IBD) involves chronic inflammation of the digestive tract. In the past decades, fecal calprotectin has been proposed as a useful biomarker for the differential diagnosis between IBD patients and healthy controls. We designed this study to evaluate the diagnostic ability of fecal calprotectin (FC) and conventional inflammatory markers in IBD patients. M...
متن کاملApplication of adjusted-receiver operating characteristic curve analysis in combination of biomarkers for early detection of gestational diabetes mellitus
Introduction: In medical diagnostic field, evaluation of diagnostic accuracy of biomarkers or tests has always been a matter of concern. In some situations, one biomarker alone may not be sufficiently sensitive and specific for prediction of a disease. However, combining multiple biomarkers may lead to better diagnostic. The aim of this study was to assess the performance of combination of bio...
متن کاملمقایسه مدل درخت تصمیم و رگرسیون لوجستیک در ارزیابی پوکی استخوان
Introduction: Early detection of osteoporosis is a key to preventing of it; but recognition, without the use of appropriate diagnostic methods, due to the complexity of risk factors and gradual bone loss process, is problem. The purpose of this study is to develop and efficiency evaluation a predictive model of osteoporosis using decision tree technique as a diagnostic method based on available...
متن کاملReply to letter to the editor
To the Editor: I wish to comment on several methodological issues in a recent systematic review and meta-analysis on the diagnostic accuracy of ultrasound-guided core-needle biopsy for head and neck lesions. This study suffers from several problems that are commonly seen in meta-analyses of diagnostic accuracy studies. First, meta-analysis requires specialized statistical methods to synthesize ...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
عنوان ژورنال:
دوره 9 شماره
صفحات -
تاریخ انتشار 2014